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INTRODUCTION 


Tuberous  sclerosis  complex  (TSC)  is  a  neurological  disorder  characterized  by  the  formation 
of  hamartomas  in  several  organs,  including  brain,  skin,  kidney,  heart,  and  liver  (1-2).  This 
syndrome  often  manifests  in  early  age  with  infantile  seizures  and  patients  may  have  mental 
retardation  and  autism.  Brain  lesions  include  cerebral  cortical  tubers,  subependymal  nodules,  and 
retinal  hamartomas.  Patients  often  develop  skin  lesions,  including  facial  angiofibromas, 
hypomelanotic  macules,  and  shagreen  patches.  In  the  kidney,  benign  angiomyolipomas  occur 
frequently,  whereas  malignant  angiomyolipomas  and  renal  cell  carcinomas  are  less  common. 
Brain  and  renal  lesions  cause  the  highest  morbidity  and  mortality  among  TSC  patients  (1-2). 

The  tumor  suppressor  genes  TSC1  and  TSC2  encode  the  proteins  hamartin  and  tuberin, 
respectively  (3,4),  and  their  mutagenic  inactivation  occurs  in  a  wide  variety  of  hamartomas  in 
TSC  patients.  Hamartin  and  tuberin  are  expressed  in  the  same  cell  types  within  multiple  organs 
including  the  kidney,  brain,  lung  and  pancreas,  and  interact  directly  forming  a  heterodimer  (5). 
Hamartin  is  an  1164-amino  acid  hydrophilic  protein  expressed  in  most  human  tissues,  and  has  a 
single  transmembrane  domain  (amino  acids  127-144)  and  a  predicted  coiled-coil  region  at 
residues  719-998.  Tuberin  is  an  1807-amino  acid  protein  and  contains  a  GAP  domain  spanning 
residues  1517-1674  that  inhibits  the  G  protein  Ras  homolog  enriched  in  brain  (Rheb)  (6,7).  Rheb 
activates  the  mammalian  target  of  Rapamycin  (mTOR)  and  inactivates  B-Raf  (7).  Activation  of 
mTOR  leads  to  increased  protein  synthesis.  Upon  insulin  or  growth  factor  stimulation,  tuberin  is 
phosphorylated  by  several  kinases,  including  AKT/PKB,  leading  to  inhibition  of  its  GAP  activity 
and  activation  of  mTOR.  Several  unrelated  patients  with  tuberous  sclerosis  have  point  mutations 
in  the  TSC2  GAP  domain  that  influence  the  stability  of  the  TSC1-TSC2  complex  (8,9). 

The  tuberin  region  spanning  residues  1-418  contains  a  hamartin-binding  domain  (Figure 
1A),  which  is  necessary  to  maintain  hamartin  in  a  soluble  form  in  the  cytosol  (9).  Within  this 
region,  amino  acids  346-371  are  predicted  to  form  a  putative  coiled-coil,  which  is  necessary  but 
not  sufficient  to  mediate  the  interaction  with  hamartin  because  additional  N-terminal  residues  are 
also  required  for  the  efficient  formation  of  the  TSC1-TSC2  complex  (9).  The  hamartin  region 
spanning  amino  acids  302-430  mediates  the  interaction  with  tuberin  (9).  Importantly,  certain 
non-truncating  mutations  in  patients  with  TSC  disrupt  the  hamartin-tuberin  interaction  (9).  For 
example,  non-truncating  TSC-causing  mutations  located  within  the  binding  region  of  hamartin 
(N198_F199delinsI;593-595delACT)  or  tuberin  (G294E  and  I365del),  abolish  or  dramatically 
reduce  the  interaction  of  these  proteins.  In  contrast,  non-pathogenic  missense  polymorphisms  of 
tuberin  (R261W,  M286V,  R367Q)  in  the  same  region  as  the  disease-causing  TSC2  mutations  do 
not  affect  this  interaction,  indicating  that  physical  association  of  hamartin  with  tuberin  is 
required  for  the  growth  suppressing  functions  of  these  proteins  (9). 

BODY 


During  the  one-year  award  period  we  proceeded  with  the  experiments  proposed  in  the  Tasks 
of  the  original  application.  A  description  of  our  progress  in  these  studies  follows  below. 

Task  1,  To  express,  purify,  and  crystallize  the  hamartin  and  tuberin  interaction 
domains.  To  crystallize  the  hamartin(302-430)  and  tuberin(l-418)  domains  it  is  necessary  to 
produce  large  amounts  (multi-milligram  quantities)  of  these  protein  fragments  in  bacterial  cells 
in  soluble  form  and  purified  them  to  homogeneity.  Toward  this  goal,  the  DNA  fragments 
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encoding  the  human  hamartin(302-430)  and  tuberin(l-418)  regions  were  amplified  by  the 
polymerase  chain  reaction  (PCR)  using  the  human  hamartin  and  tuberin  cDNAs  as  templates, 
respectively,  (kind  gifts  from  Dr.  Angela  Hodges,  MRC  Centre  for  Neurodegeneration 
Research,  King's  College,  London)  and  cloned  them  following  standard  protocols  (10)  into 
pET-6H2,  a  modified  pET-1 6b  prokaryotic  vector  (Novagen)  that  allows  the  expression  of 
recombinant  hexahistidine-tagged  proteins  in  Escherichia  coli  BL21(DE3)  cells,  as  we  described 
previously  (11).  Numerous  attempts  to  produce  large  amounts  of  soluble  hamartin(302-430)  and 
tuberin(l-418)  proteins  in  several  E.  coli  strains  using  this  vector  in  a  wide  range  of  temperatures 
and  in  different  culture  media  failed  to  produce  large  amounts  of  soluble  proteins  necessary  for 
the  crystallization  experiments.  The  encountered  difficulties  to  produce  large  amounts  of  soluble 
recombinant  proteins  delayed  the  successful  completion  of  the  proposed  tasks  within  the  one- 
year  award  period.  To  solve  this  problem,  we  used  PCR-mediated  site-directed  mutagenesis  to 
optimize  many  rare  arginine  codons  in  the  open  reading  frames  of  hamartin(302-430)  and 
tuberin(l-418)  regions,  and  cloned  the  resulting  DNA  fragments  into  a  series  of  prokaryotic 
expression  vectors,  including  pGEX-2  (Pharmacia)  to  produce  them  as  fusions  with  glutathione 
S-transferase,  pET-43a  (Novagen)  to  produce  them  as  fusions  with  NusA,  pMAL-c2  (New 
England  BioLabs)  to  produce  them  as  fusions  with  maltose-binding  protein,  and  pTYB12  (NEB) 
to  produce  them  as  fusions  with  intein.  The  resulting  constructs  were  verified  by  DNA 
sequencing  and  were  used  to  transform  E.  coli  BL21(DE3)  cells.  Extensive  tests  with  these 
constructs  resulted  in  the  production  of  soluble  hamartin(302-430)  and  tuberin(l-418)  proteins. 
The  hamartin(3 02-430)  and  tuberin(l-418)  domains  were  purified  using  a  combination  of 
affinity,  ion  exchange,  and  size  exclusion  chromatography  (Figure  1B,C).  Crystallization 
experiments  were  performed  with  the  pure  proteins  using  sparse  matrix  screenings  and  vapor 
diffusion  crystallization  methods.  Needle-like  microcrystals  of  the  hamartin(3 02-430)  protein 
were  obtained  in  several  days  at  20°C  using  the  sitting  drop  vapor  diffusion  method  (Figure  ID). 
These  crystals  are  too  small  for  analysis  and  experiments  are  in  progress  to  optimize  the 
crystallization  conditions.  Likewise,  microcrystals  of  the  tuberin(l-418)  protein  were  obtained  at 
20°C  using  the  sitting  drop  vapor  diffusion  method  (Figure  IE).  These  crystals  diffract  to  low 
resolution  (Figure  IF)  and  we  are  in  the  process  of  optimizing  the  crystallization  conditions  in 
order  to  obtain  larger  crystals  that  diffract  to  higher  resolution. 

Task  2.  To  determine  the  crystal  structures  of  the  hamartin(302-430)  and  tuberin(l- 
418)  domains.  Because  the  hamartin(302-430)  and  tuberin(l-418)  proteins  interact  to  form  a 
heterodimer,  we  assembled  this  complex  by  mixing  equimolar  amounts  of  these  proteins  and 
performed  crystallization  experiments.  Despite  numerous  crystallization  trials,  no  crystals  of  the 
hamartin(302-430)-tuberin(l-418)  protein  complex  have  been  obtained  to  date.  We  will  proceed 
with  the  crystallization  of  this  complex  by  taking  special  care  to  purify  monodisperse  protein 
complexes  for  crystallization  trials  using  size  exclusion  chromatography.  For  structure 
determination,  crystals  of  selenomethionine-containing  recombinant  hamartin(302-430)  and 
tuberin(l-418)  proteins  will  be  used  to  collect  multiwavelength  anomalous  diffraction  (MAD) 
diffraction  data  using  synchrotron  radiation.  The  crystal  structures  will  be  determined  with  MAD 
methods.  I  would  like  to  emphasize  that  although  the  award  period  is  finished,  we  will 
continue  to  pursue  very  persistently  the  crystallographic  analysis  of  the  hamartin(302-430) 
and  tuberin(l-418)  proteins,  as  outlined  in  the  original  proposal.  The  one-year  Concept 
Award  has  provided  critical  support  to  initiate  this  research  project  and  because  of  our  progress 
in  producing  these  proteins  and  initiating  crystallization  experiments,  we  plan  to  expand  on  these 
studies  and  apply  this  year  to  the  Department  of  Defense  for  an  Idea  Development  Award. 
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Figure  1.  (A)  Schematic  representation  of  the  human  proteins  hamartin  and  tuberin  (not  to  scale) 
and  their  domains  proposed  to  be  analyzed  crystallographically.  Numbers  indicate  amino  acid 
residues.  (B)  SDS-PAGE  analysis  of  the  hamartin(302-430)  domain  expressed  in  E.  coli 
BL21(DE3)  cells  and  purified  using  affinity,  ion  exchange,  and  size  exclusion  chromatography. 
Lane  M:  protein  markers  in  kDa;  lanes  1-6 :  elution  fractions  of  purified  hamartin(302-430) 
protein.  (C)  SDS-PAGE  analysis  of  the  tuberin(l-418)  domain  expressed  in  E.  coli  BL21(DE3) 
cells  and  purified  using  affinity  chromatography.  Lane  M :  protein  markers  in  kDa;  lanes  1-4: 
elution  fractions  of  tuberin(l-418)  protein.  (D)  Needle-like  microcrystals  of  the  hamartin(302- 
430)  domain  obtained  by  the  sitting  drop  vapor  diffusion  method.  (E)  Microcrystals  of  the 
tuberin(l-418)  domain  obtained  by  the  sitting  drop  vapor  diffusion  method.  (F)  Diffraction 
pattern  of  a  tuberin(l-418)  crystal  collected  on  an  R-AXIS  IV  detector  system  with  3°  oscillation 
frames  exposed  for  30  min  at  room  temperature  and  crystal-to-detector  distance  of  150  mm. 
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KEY  RESEARCH  ACCOMPLISHMENTS 


1.  Cloning  of  the  DNA  fragment  coding  for  the  human  hamartin(302-430)  protein  into 
prokaryotic  expression  vectors  pET-6H2,  pGEX-2,  pET-43a,  pMAL-c2,  and  pTYB12  for 
production  in  E.  coli  BL21(DE3)  cells  ( Task  1 ). 

2.  Cloning  of  the  DNA  fragment  coding  for  the  human  tuberin(l-418)  protein  into  prokaryotic 
expression  vectors  pET-6H2,  pGEX-2,  pET-43a,  pMAL-c2,  and  pTYB12  for  production  in 
E.  coli  BL21(DE3)  cells  {Task  7). 

3.  Induction  of  expression  and  performance  of  solubility  tests  of  the  bacterially  produced 
human  hamartin(302-430)  protein  (Task  7). 

4.  Induction  of  expression  and  performance  of  solubility  tests  of  the  bacterially  produced 
human  tuberin(l-418)  protein  ( Task  1 ). 

5.  Large-scale  production  of  the  human  hamartin(3 02-430)  protein  in  E.  coli  cells  and 
purification  of  this  protein  using  a  combination  of  affinity,  ion  exchange,  and  size  exclusion 
chromatography  ( Task  1 ). 

6.  Large-scale  production  of  the  human  tuberin(l-418)  protein  in  E.  coli  cells  and  purification 
of  this  protein  using  affinity,  ion  exchange,  and  size  exclusion  chromatography  ( Task  1 ). 

7.  Sparse  matrix  crystallization  experiments  with  the  purified  human  hamartin(302-430) 
protein  in  the  apo  form,  using  the  sitting  drop  vapor  diffusion  method  ( Task  1 ). 

8.  Sparse  matrix  crystallization  experiments  with  the  purified  human  tuberin(l-418)  protein  in 
the  apo  form,  using  the  sitting  drop  vapor  diffusion  method  (Task  7). 

9.  Sparse  matrix  crystallization  experiments  with  the  purified  human  hamartin(302-430) 
domain  complexed  with  the  tuberin(l-418)  protein,  using  the  sitting  drop  vapor  diffusion 
method  (Task  7). 

10.  Preliminary  X-ray  diffraction  analysis  of  the  tuberin(l-418)  crystals  with  data  collected  on 
an  R-AXIS  IV  imaging  plate  detector  system  (Task  2). 


REPORTABLE  OUTCOMES 


The  experiments  of  the  proposal  are  still  in  progress  and  no  publications  have  resulted  so  far 
on  the  structural  analysis  of  the  hamartin  and  tuberin.  Partial  support  provided  by  this  award  to 
our  laboratory  was  acknowledged  in  the  following  two  papers  that  we  published  during  the 
award  period: 

1.  Birrane  G.,  Varma  A.K.,  Soni  A.  and  Ladias  J.A.A.  (2007).  Crystal  Structure  of  the  BARD1 
BRCT  Domains.  Biochemistry  46:  7706-7712 

2.  Meiyappan  M.,  Birrane  G.  and  Ladias  J.A.A.  (2007).  Structural  Basis  for  Polyproline 
Recognition  by  the  FE65  WW  Domain.  Journal  of  Molecular  Biology  372:  970-980 


CONCLUSIONS 


During  the  one-year  award  period  we  made  numerous  attempts  to  find  conditions  that  would 
allow  the  production  of  large  amounts  of  the  human  hamartin(302-430)  and  tuberin(l-418) 
protein  fragments  in  E.  coli  cells,  using  the  prokaryotic  expression  vectors  pET-6H2,  pGEX-2, 
pET-43a,  pMAL-c2,  and  pTYB12.  After  solving  several  problems  with  the  solubility  of  these 
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proteins,  we  were  able  to  produce  them  in  a  soluble  form  appropriate  for  structural  studies.  We 
performed  a  large  number  of  crystallization  experiments  with  the  hamartin(3 02-430)  and 
tuberin(l-418)  proteins  in  the  apo  form  and  as  a  heterodimeric  complex.  Microcrystals  of  the 
unbound  hamartin(302-430)  and  tuberin(l-418)  proteins  were  obtained  using  the  sitting  drop 
vapor  diffusion  method  but  the  hamartin(302-430)-tuberin(l-418)  complex  did  not  yield  any 
crystals  to  date.  We  are  currently  in  the  process  of  optimizing  the  crystallization  conditions  of 
these  proteins  using  fractional  factorial  experiments  with  the  goal  to  obtain  diffraction-quality 
crystals.  We  will  continue  to  pursue  these  studies  even  after  the  expiration  of  the  award  period 
because  the  crystal  structures  of  the  hamartin(302-430)  and  tuberin(l-418)  proteins  will  elucidate 
the  molecular  mechanisms  underlying  the  interaction  and  function  of  these  proteins  and  will 
reveal  the  molecular  changes  induced  by  TSC-causing  mutations  in  these  proteins  that  abrogate 
their  interaction  and  contribute  to  the  pathogenesis  of  TSC. 
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abstract:  The  interaction  of  the  breast  tumor  suppressor  BRCA1  with  the  protein  BARD1  results  in  the 
formation  of  a  heterodimeric  complex  that  has  ubiquitin  ligase  activity  and  plays  central  roles  in  cell 
cycle  checkpoint  control  and  DNA  repair.  Both  BRCA1  and  BARD1  possess  a  pair  of  tandem  BRCT 
domains  that  interact  in  a  phosphorylation-dependent  manner  with  target  proteins.  We  determined  the 
crystal  structure  of  the  human  BARD1  BRCT  repeats  (residues  568—777)  at  1.9  A  resolution.  The 
composition  and  structure  of  the  BARD1  phosphoserine-binding  pocket  Pi  are  strikingly  similar  to  those 
of  the  BRCA1  and  MDC1  BRCT  domains,  suggesting  a  similar  mode  of  interaction  with  the  phosphate 
group  of  the  ligand.  By  contrast,  the  BARD1  BRCT  selectivity  pocket  P2  exhibits  distinct  structural  features, 
including  two  prominent  histidine  residues,  His685  and  His686,  which  may  be  important  for  ligand  binding. 
The  protonation  state  of  these  histidines  has  a  marked  effect  on  the  calculated  electrostatic  potential  in 
the  vicinity  of  P2,  raising  the  possibility  that  ligand  recognition  may  be  regulated  by  changes  in  pH. 
Importantly,  the  BARD1  BRCT  structure  provides  insights  into  the  mechanisms  by  which  the  cancer- 
associated  missense  mutations  C645R,  V695L,  and  S761N  may  adversely  affect  the  structure  and  function 
of  BARD1. 


The  BRCA1 -associated  RING  domain  protein  1  (BARD1)1 
is  the  major  binding  partner  of  the  breast  and  ovarian  tumor 
suppressor  BRCA1,  and  it  plays  a  central  role  in  the 
regulation  of  its  stability,  cellular  localization,  and  function 
(7,  2).  The  BRCA1— BARD1  heterodimer  has  ubiquitin 
ligase  activity  and  participates  in  the  maintenance  of  genomic 
stability  through  its  function  in  cell  cycle  checkpoint  control 
and  DNA  repair  (7,  2).  In  addition,  BARD1  mediates 
apoptosis  in  a  p5 3 -dependent,  B RC A 1 -independent  manner, 
and  BARD  1 -repressed  cells  are  resistant  to  apoptosis  ( 3 ,  4). 

Human  BARD1  is  a  777  amino  acid  protein  that  contains 
a  RING  finger  (residues  46—90),  three  ankyrin  repeats 
(residues  420—525),  and  two  tandem  BRCT  domains  (resi¬ 
dues  568— 777)  (5,  6).  Germline  and  somatic  mutations  that 
affect  the  BARD1  structure  occur  in  a  subset  of  breast, 
ovarian,  and  uterine  cancers,  and  some  of  these  mutations 
have  been  identified  in  hereditary  breast  and  ovarian  cancers 
from  patients  negative  for  mutations  in  BRCA1  (7— 9).  Two 
missense  mutations  (C557S  and  Q564H)  occur  in  the  region 
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between  the  ankyrin  repeats  and  the  BRCT  domains,  while 
others  (N295S  and  K312N)  occur  outside  the  known 
domains.  Three  cancer-associated  missense  mutations,  C645R, 
V695L,  and  S761N,  occur  within  the  BARD1  BRCT 
domains  (Figure  1  A).  C645R  has  been  associated  with  breast 
and  ovarian  cancer,  V695L  has  been  found  in  breast  cancer, 
and  S761N  has  been  identified  in  uterine  and  breast  cancer 
(7,  9).  These  mutations  adversely  affect  the  tumor  suppressor 
function  of  BARD1  (9),  but  the  underlying  molecular 
mechanisms  are  not  known. 

BRCT  repeats  are  phosphopeptide-binding  modules  found 
in  a  number  of  proteins  that  function  in  DNA-damage 
checkpoint  control  and  DNA  repair  (10—12).  Recent  crystal¬ 
lographic  studies  of  the  BRCA1  and  MDC1  BRCT  domains 
bound  to  their  target  peptides  offered  mechanistic  insights 
into  ligand  recognition  by  these  modules  (13—17).  Specif¬ 
ically,  a  shallow  pocket  (Pi)  in  the  N-terminal  BRCT  receives 
the  phosphoserine  (pSer)  at  position  0  of  the  ligand,  whereas 
a  deeper  hydrophobic  pocket  (P2)  in  the  groove  between  the 
N-  and  C-terminal  BRCT  repeats  selects  residues  at  position 
+3.  The  amino  acid  composition  and  architecture  of  Pi  are 
similar  among  several  BRCTs,  consistent  with  its  nondis- 
criminatory  binding  to  pSer  0.  In  contrast,  the  P2  pockets 
from  various  BRCT  domains  display  significant  differences 
that  determine  their  selectivity  properties.  The  molecular 
basis  for  the  selection  of  Phe  +3  by  the  BRCA1  BRCTs 
was  revealed  by  the  crystal  stmctures  of  these  domains  bound 
to  the  BACH1  (73,  14)  and  CtIP  (16)  ligands,  whereas  the 
crystal  structure  of  the  MDC1  BRCTs  bound  to  the  C- 
terminal  tail  of  y-H2AX  elucidated  the  mode  of  Tyr  +3 
recognition  (77).  However,  oriented  peptide  library  studies 
provided  evidence  that  the  BARD1  BRCTs  select  for  ligands 
with  the  motif  (pSer  0)-(Asp/Glu  +  l)-(Asp/Glu  +2)-(Glu 
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Figure  1:  (A)  Sequence  alignment  of  the  BARD1,  BRCA1,  and  MDC1  BRCT  domains.  The  protein  sequences  were  aligned  using  the 
program  CLUSTAL  W  (31)  and  manual  intervention.  Hyphens  represent  gaps  inserted  for  optimum  alignment.  The  secondary  structure 
elements  of  BARD1,  assigned  by  the  program  STRIDE  (32),  are  depicted  as  orange  arrows  (^-strands)  and  purple  cylinders  (helices).  The 
linker  a-helix  aL  is  colored  pink.  The  BRCT2  secondary  structure  elements  are  labeled  with  primes.  Residues  lining  the  Pi  and  P2  pockets 
are  shown  as  white  letters  on  blue  and  red  background,  respectively.  Identical  amino  acids  in  two  or  more  sequences  are  shaded  gray,  while 
residues  at  the  interface  of  the  BRCT  repeats  are  highlighted  yellow.  Cancer-associated  missense  mutations  are  encircled.  (B)  Ribbon 
representation  of  the  BARD1  BRCTs.  The  location  of  the  Pi  and  P2  pockets  are  denoted.  Strands  and  helices  are  colored  as  in  (A).  The 
figure  was  made  using  BOBSCRIPT  (33).  (C)  Stick  model  superposition  of  the  Pi  pockets  of  the  BRCA1— CtIP  complex  (PDB  entry 
1Y98)  and  the  unbound  BARD1.  The  CtIP  peptide  is  colored  pink,  the  interacting  residues  of  BRCA1  are  in  green,  and  water  molecules 
are  cyan  spheres.  BARD1  residues  are  colored  gray,  and  an  associated  water  molecule  is  shown  as  a  red  sphere.  Nitrogen,  oxygen,  phosphorus, 
and  sulfur  atoms  are  shown  in  blue,  red,  magenta,  and  yellow,  respectively.  Hydrogen  bonds  are  dashed  lines.  The  orientation  of  the  Pi 
pocket  is  identical  to  that  of  (B).  (D)  A  weighted  2 F0  —  Fc  electron  density  map  of  residues  lining  the  P2  pocket  calculated  at  1.9  A 
resolution  and  contoured  at  l.Ocr.  Atoms  are  colored  as  in  (C).  For  clarity,  the  model  has  been  rotated  approximately  90°  in  relation  to  (B) 
such  that  the  view  is  from  underneath  the  P2  pocket.  The  figure  was  made  using  POVSCRIPT  (34)  and  POV-Ray  (www.povray.org). 
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+3)  (72),  suggesting  that  the  BARD1  selectivity  determinants 
are  different  from  those  of  other  BRCT  domains. 

To  elucidate  the  molecular  mechanisms  underlying  the 
function  of  the  BARD1  BRCT  repeats,  we  have  determined 
the  crystal  structure  of  these  domains  in  the  apo  form. 
Comparison  with  the  BRCA1  and  MDC1  BRCTs  reveals 
distinct  structural  differences  in  the  C-terminal  BRCT  of 
BARD1,  the  most  notable  of  which  is  the  presence  of  His685 
and  His686  at  the  selectivity  pocket  P2,  suggesting  their 
involvement  in  ligand  recognition.  In  addition,  the  present 
work  provides  insights  into  the  mechanisms  by  which  cancer- 
associated  mis  sense  mutations  within  the  BRCT  modules 
have  detrimental  effects  on  BARD1  function. 

EXPERIMENTAL  PROCEDURES 

Protein  Purification  and  Crystallization.  A  DNA  fragment 
encoding  the  human  BARD1  BRCT  domains  (residues  568— 
111)  was  amplified  using  the  polymerase  chain  reaction  and 
cloned  into  a  modified  pET-6H  vector  (18).  The  BARD1 
BRCT  protein  carrying  an  N-terminal  hexahistidine  tag 
followed  by  a  site  for  the  human  rhino  virus  3C  protease 
(EVLFQiGP)  was  expressed  in  Escherichia  coli  BL21(DE3) 
cells  grown  at  37  °C  until  they  reached  an  absorbance  at 
600  nm  of  0.5,  followed  by  induction  with  0.5  mM  IPTG  at 
20  °C  for  20  h.  The  cells  were  harvested  by  centrifugation, 
resuspended  in  phosphate-buffered  saline  supplemented  with 
protease  inhibitor  cocktail  tablets  (Roche  Applied  Science) 
on  ice,  and  lysed  on  an  EmulsiFlex-C3  homogenizer  (Aves- 
tin).  Although  the  majority  of  the  recombinant  BRCT  protein 
was  found  in  the  insoluble  fraction,  an  amount  adequate  for 
structural  studies  was  recovered  as  soluble  protein  (~4  mg/L 
of  culture).  Soluble  6His-BRCT  protein  was  purified  on  Ni- 
NTA  resin  (Qiagen),  eluted  with  200  mM  imidazole,  and 
dialyzed  against  a  buffer  containing  25  mM  Tris-HCl  (pH 
7.0),  300  mM  NaCl,  and  2  mM  DTT.  The  hexahistidine  tag 
was  removed  by  digestion  with  3C  protease,  and  the  protein 
was  further  purified  by  size  exclusion  chromatography  on 
Superdex  75  (GE  Healthcare).  The  BARD1  BRCT  protein 
was  concentrated  to  25  mg/mL  by  ultracentrifugation  and 
crystallized  in  20—25%  polyethylene  glycol  5000  mono¬ 
methyl  ether,  150  mM  ammonium  sulfate,  and  100  mM 
2-morpholinoethanesulfonic  acid  (pH  6.5)  at  20  °C  by  the 
sitting  drop  vapor  diffusion  method.  Crystals  were  cryopro- 
tected  in  mother  liquor  supplemented  with  16%  ethylene 
glycol  and  flash- frozen  in  a  liquid  nitrogen  stream.  BARD1 
BRCT  protein  containing  selenomethionine  (SeMet)  was 
expressed  in  B834(DE3)  cells  (Novagen)  cultured  in  minimal 
media  supplemented  with  SeMet  (40  mg/L)  and  purified  in 
a  similar  manner.  Native  and  single- wavelength  anomalous 
dispersion  (SAD)  data  were  collected  on  beamline  X12B  at 
the  National  Synchrotron  Light  Source  (NSLS),  Brookhaven 
National  Laboratory,  Upton,  NY.  The  data  were  processed 
using  HKL2000  (19).  The  crystals  belong  to  space  group 
P2\2\2\  with  unit  cell  dimensions  a  =  57.4  A,  b  =  75.8  A, 
c  =  1 16.6  A,  and  a  =  /3  =  y  =  90°.  There  are  two  molecules 
in  the  asymmetric  unit,  referred  to  as  A  and  B. 

Structure  Determination  and  Refinement.  Data  analysis, 
location  of  the  selenium  atoms,  phase  calculation,  and  density 
modification  were  carried  out  using  SHELXC,  SHELXD, 
and  SHELXE  (20),  respectively.  Phase  extension  and 
automated  model  building  were  performed  with  ARP/wARP 


Table  1:  Structure  Determination  and  Refinement  Statistics 


data  set 

native 

SAD 

wavelength  (A) 

0.9784 

0.9796 

resolution  range  (A)° 

50-1.90 

50-2.04 

(1.97-1.90) 

(2.11-2.04) 

no.  of  obsd  reflections 

309077 

304610 

no.  of  unique  reflections 

40185 

32924 

completeness  (%)a 

98.3  (91.7) 

99.2  (98.9) 

redundancy0 

7.7  (6.1) 

9.3  (8.9) 

Rsym  (%)b 

6.4  (40.2) 

8.7  (38.2) 

overall  ( I/o(l )) 

30.7  (4.3) 

19.7  (5.5) 

T^cryst  (%)c 

19.6 

Rfree  (%Y 

24.7 

Ramachandran  plot 

most  favored  (%) 

91.0 

additionally  allowed  (%) 

8.2 

generously  allowed  (%) 

0.5 

disallowed  (%) 

0.3 

bond  lengths^  (A) 

0.022 

bond  angles^  (deg) 

1.865 

a  Values  in  parentheses  are  for  the  highest  resolution  shell.  b  Rsym  = 
E|7  -  (/) | /E(7),  where  I  is  the  observed  integrated  intensity,  (7)  is  the 
average  integrated  intensity  obtained  from  multiple  measurements,  and 
the  summation  is  over  all  observed  reflections.  c  Rcryst  =  El  \Fo\  ~  k\Fc\ | / 
El E0 1,  where  F0  and  Fc  are  the  observed  and  calculated  structure  factors, 
respectively.  ^Rfree  is  calculated  as  Rcryst  using  5%  of  the  reflections 
chosen  randomly  and  omitted  from  the  refinement  calculations.  e  Bond 
lengths  and  angles  are  root-mean- square  deviations  from  ideal  values. 


(27),  in  combination  with  manual  intervention  using  COOT 
(22).  The  final  model,  refined  isotropically  with  eight  TLS 
groups  in  REFMAC5  (23),  contains  3382  protein  atoms,  238 
water  molecules,  2  sulfate  ions,  3  chloride  ions,  and  5 
ethylene  glycol  molecules.  Weak  electron  density  in  pocket 
P2  was  modeled  as  ethylene  glycol  in  molecule  A  and  as 
water  in  molecule  B.  The  N-terminal  residues  Gly  and  Pro 
that  remain  after  cleavage  with  3C  protease  are  native  to 
BARD1  (Gly568  and  Pro569).  Gly568  of  molecule  B  is 
disordered  and  was  not  modeled.  PROCHECK  (24)  was  used 
for  analysis  and  validation  of  the  refined  structure.  Glu655 
of  molecule  A  is  at  the  edge  of  the  disallowed  region  of  the 
Ramachandran  plot  (cp  =  75.7°,  xp  =  —119.8°).  The  structure 
determination  and  refinement  statistics  are  shown  in  Table  1. 

Electrostatic  Potential  Calculation.  The  program  GRASP 
(25)  was  used  to  calculate  the  electrostatic  potentials.  Charges 
were  assigned  as  follows:  Arg  N^1  and  Arg  N*72,  +0.5  e; 
Glu  Oel,  Glu  Oe2,  Asp  Odl,  and  Asp  O62,  -0.5  e;  Lys  N^, 
1.0  e.  Dielectric  constants  of  2  and  80  were  used  for  the 
interior  protein  and  outer  solvent  regions,  respectively.  A 
charge  of  +0.5  e  was  used  for  Ndl  and  N62  of  the  protonated 
His685  and  His686,  in  addition  to  the  above  charges. 

Isothermal  Titration  Calorimetry.  The  binding  capability 
of  BARD  1  BRCT  to  various  synthetic  peptides  was  measured 
using  a  VP-ITC  microcalorimeter  (MicroCal).  Briefly,  0.5 
mM  peptides  SRSTpSDDENK  and  KEESpSDDEDK  were 
titrated  against  0.05  mM  BARD1  BRCT  protein  in  phosphate- 
buffered  saline  (pH  7.4)  supplemented  with  150  mM  NaCl 
at  23  °C.  Similar  experiments  were  also  performed  in  100 
mM  Tris-HCl  (pH  7.4)  and  150  mM  NaCl  and  in  25  mM 
sodium  phosphate  (pH  6.0)  and  400  mM  NaCl  at  23  °C. 
Titration  curves  were  analyzed  using  the  program  ORIGIN 
5.0  (OriginLab). 

RESULTS  AND  DISCUSSION 

Structure  Determination.  We  expressed  the  human  BARD1 
BRCT  domains  (residues  568—777)  carrying  an  N-terminal 
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hexahistidine  tag  followed  by  a  3C  protease  cleavage  site  in 
E.  coli  BL21(DE3)  cells.  We  purified  the  soluble  fraction 
of  the  recombinant  protein  using  affinity  chromatography, 
removed  the  tag  by  digestion  with  3C  protease,  and  crystal¬ 
lized  the  native  protein  using  the  sitting  drop  vapor  diffusion 
method.  The  structure  was  determined  by  SAD,  using 
synchrotron  data  from  SeMet- substituted  crystals.  There  are 
two  molecules  in  the  asymmetric  unit,  A  and  B,  which 
display  differences  in  the  region  spanning  residues  708— 
718.  An  a-helix  in  this  area  in  molecule  B  corresponds  to 
two  short  3  io-helices  in  molecule  A.  Since  molecule  A  has 
superior  electron  density  and  its  ligand-binding  region  is 
more  solvent-exposed,  thus  resembling  more  closely  the 
physiologic  milieu,  it  will  be  used  to  describe  the  present 
structure.  The  final  model  was  refined  to  a  crystallographic 
factor  Rcryst  of  19.6%  and  an  Rfree  of  24.7%  with  data 
extending  to  1.9  A  resolution. 

Structure  of  the  BARD1  BRCT  Repeats.  The  overall 
topology  of  the  BARD1  BRCT  domains  is  similar  to 
previously  determined  BRCA1  and  MDC1  BRCT  structures. 
The  region  spanning  residues  568—777  folds  into  two  tandem 
domains,  BRCT1  (residues  568—654)  and  BRCT2  (residues 
669—777),  linked  by  a  central  a-helix  (aL)  (Figure  IB). 
BRCT1  comprises  a  central  /3-sheet  formed  by  four  parallel 
/3-strands  (/3l  —  /34)  and  flanked  by  the  a-helices  al  and  a3 
on  one  side  and  a2  on  the  other.  BRCT2  also  consists  of  a 
parallel  /3-sheet  formed  by  the  /3-strands  pY—p 4',  which  are 
neighbored  on  one  side  by  the  a-helices  al'  and  a3'. 
However,  the  a2'  helix  that  exists  between  /33'  and  /J4'  in 
other  BRCT2  structures  is  replaced  by  a  short  antiparallel 
/3-strand  (/3a')  in  BARD1  BRCT2  (Figure  IB).  Another 
unique  feature  of  the  BARD1  BRCTs  is  the  presence  of  three 
short  3io-helices  (3i0b',  3ioc',  and  3iod')  in  the  /32'— /33'  loop. 

The  two  BRCT  modules  pack  closely  against  each  other, 
burying  a  hydrophobic  interface  of  ~1450  A2  and  creating 
a  surface  groove  with  two  pockets,  Pi  and  P2.  The  hydrophilic 
Pi  is  formed  by  residues  Ser575,  Gly576,  Thr617,  and 
Fys619  (Figure  1C),  conforming  to  the  consensus  motif  (Ser/ 
Thr)-Gly...Thr-X-Fys  that  is  characteristic  of  the  pSer- 
interacting  BRCA1  and  MDC1  Pi  pockets  (Figure  1A).  A 
sulfate  ion  from  the  crystallization  solution  is  present  in  the 
Pi  pocket,  where  it  makes  direct  and  solvent-mediated 
interactions  with  BRCT  residues.  Specifically,  the  sulfate  ion 
hydrogen  bonds  with  the  main  chain  N  of  Gly576,  the  Or 
of  Ser575,  and  the  of  Fys619  and  via  a  water  molecule 
to  the  main  chain  N  of  Fys619  (Figure  1C).  These  interac¬ 
tions  are  strikingly  similar  to  those  stabilizing  the  phosphate 
group  of  the  ligand  pSer  0  in  the  BRCA1  and  MDC1 
structures  (73—77),  suggesting  that  the  BARD1  Pi  residues 
may  also  be  involved  in  similar  interactions  with  the 
phosphate  group  of  the  ligand. 

The  deeper  and  more  hydrophobic  pocket  P2  is  lined  by 
Ser616,  Met621,  His685,  His686,  and  Ile764  (Figure  ID). 
Two  prominent  features  of  this  pocket  are  His685  and 
His686,  corresponding  to  Asnl774  and  Metl775  of  BRCA1, 
respectively,  that  mediate  interactions  with  Phe  +3  of  the 
ligand  (73—76),  suggesting  that  the  BARD1  histidines  could 
play  a  key  role  in  ligand  selection.  The  electron  density  of 
the  His685  imidazole  is  weak,  indicating  that  this  side  chain 
is  flexible  (Figure  ID).  By  contrast,  the  imidazole  group  of 
His686  is  well  ordered,  and  its  position  and  geometry  at  the 
rim  of  P2  favor  the  formation  of  a  hydrogen  bond  between 


Figure  2:  Electrostatic  potential  of  the  BARD1  BRCT  surface  (A) 
at  near  neutral  pH  and  (B)  at  acidic  pH.  Pockets  Pi  and  P2  are 
encircled,  and  the  residues  that  form  them  are  indicated.  Arrows 
denote  the  change  of  the  electrostatic  potential  due  to  the  (de)- 
protonation  of  His685  and  His686.  Electrostatic  potentials  were 
calculated  with  the  program  GRASP  (25)  and  are  colored  as  red 
(acidic,  —  lOk^T),  white  (neutral,  0kBT),  and  blue  (basic,  10 kBT). 

its  N62  atom  and  the  side  chain  of  a  polar  residue  at  position 
+3  of  the  ligand.  Importantly,  calculation  of  the  electrostatic 
potential  of  the  P2  pocket  shows  that  it  is  dramatically  altered 
by  changes  in  the  protonation  state  of  His685  and  His686. 
At  near  neutral  pH,  the  solvent-exposed  Ndl  and  N62  atoms 
of  His685  and  His686  are  not  protonated,  and  the  P2  pocket 
has  a  negative  electrostatic  potential  (Figure  2A),  whereas 
at  more  acidic  pH,  these  atoms  become  protonated,  switching 
the  net  charge  to  positive  (Figure  2B).  This  raises  the 
intriguing  possibility  that  the  BARD1  BRCT  interaction  with 
its  ligand(s)  may  be  dynamically  regulated  by  the  protonation 
of  His685  and  His686  in  response  to  pH  shifts  in  local 
cellular  microenvironments  during  various  physiological  and 
pathological  conditions  (26— 28).  Notably,  such  a  “histidine 
switch”  mechanism  regulates  the  interaction  of  the  receptor- 
associated  protein  with  the  low-density  lipoprotein  receptor- 
related  protein  as  a  consequence  of  pH  changes  (29). 
Furthermore,  it  was  recently  reported  that  the  protonation 
of  a  conserved  histidine  at  the  binding  pocket  of  PSD-95 
PDZ3  controls  the  interaction  of  this  domain  with  its  target 
peptide  in  a  pH-dependent  manner  (30).  It  remains  to  be 
determined  whether  a  similar  mechanism  is  operational  in 
ligand  recognition  by  the  BARD1  BRCT  domains. 
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Figure  3:  (A)  Superposition  of  the  crystal  structures  of  the  BARD1 
BRCTs  (beige)  and  the  unbound  BRCA1  BRCTs  (light  blue)  (PDB 
entry  1JNX).  (B)  Superposition  of  the  BARD1  BRCTs  (beige)  and 
the  unbound  MDC1  BRCTs  (green)  (PDB  entry  2ADO).  The 
models  were  aligned  using  secondary  structure  matching  (35)  in 
COOT. 

Structural  Comparison  of  the  BARD1,  BRCA1,  and  MDC1 
BRCT  Repeats.  The  BARD1  and  BRCA1  BRCT  structures 
are  superimposed  well,  with  a  root-mean- square  deviation 
of  1.7  A  for  all  Ca  atoms  (Figure  3A).  However,  there  are 
distinct  structural  differences  in  these  domains  that  likely 
reflect  their  functional  differences.  The  /33~a2  loop  of 
BARD1  is  three  residues  shorter  than  its  BRCA1  counterpart, 
and  these  regions  are  not  superimposable.  The  BARD1  loop 
a3— aL  is  nine  residues  shorter  and  adopts  an  entirely 
different  conformation  from  the  corresponding  loop  of 
BRCA1.  The  pi'— /J3'  loop  of  BRCA1  is  19  residues  shorter 
than  that  of  BARD1  and  is  missing  the  three  3io-helices  of 
BARD1.  Furthermore,  the  a2'  helix  in  the  p3'—p4'  loop  of 
BRCA1  is  replaced  by  the  Pa  strand  in  BARD1. 

The  BARD1  and  MDC1  BRCT  structures  are  superim¬ 
posed  less  well,  with  a  root-mean- square  deviation  of  2.3  A 
for  all  Ca  atoms  (Figure  3B).  One  notable  difference  is  the 
presence  of  the  a-helix  aLl  in  MDC1  that  is  absent  in 
BARD1.  Like  BRCA1,  the  /?2/-y83/  loop  of  MDC1  is  21 
residues  shorter  than  that  of  BARD1  and  is  missing  the  three 
3  io-helices  of  BARD  1,  whereas  the  BARD1  a3— aL  loop  is 
10  residues  shorter  than  that  of  MDC1.  The  aL— PY  loop 
of  BARD1  is  2  residues  longer  than  the  corresponding  loop 
of  MDC1,  resulting  in  poor  superposition  of  these  regions. 
Another  difference  is  the  presence  of  the  elongated  and  bent 
a.2'  helix  in  the  /J3'— /J4'  loop  of  MDC1  that  is  replaced  by 
the  short  Pa  strand  in  BARD1. 

Ligand  Selection  hy  the  BARD1  BRCTs.  To  study  the 
ligand-binding  properties  of  the  BARD1  BRCTs,  we  used 
isothermal  titration  calorimetry  to  measure  the  affinity  of 
these  domains  for  the  synthetic  peptides  SRSTpSDDENK 
and  KEESpSDDEDK,  which  contain  the  consensus  motif 


Figure  4:  (A)  Mapping  of  the  cancer-associated  missense  muta¬ 
tions  C645R,  V695L,  and  S761N  on  the  BARD1  BRCT  structure. 
The  side  chains  of  Arg645  and  Leu695  that  are  predicted  to 
destabilize  the  BRCT  fold  are  shown  as  green  space-filled  models, 
whereas  the  surface-exposed  side  chain  of  Asn761  is  colored  red. 
(B)  Proposed  mechanism  for  the  abrogation  of  the  BARD1  BRCT 
function  by  the  S761N  mutant.  The  Asn761  O51  is  in  close 
proximity  to  His686  Ne2  (2.7  A)  to  form  a  hydrogen  bond  (green 
dotted  line)  that  locks  the  imidazole  ring  of  His686  in  a  conforma¬ 
tion  that  cannot  interact  with  the  side  chain  of  a  residue  at  position 
+3  of  the  ligand.  The  figure  was  made  using  BOBSCRIPT  and 
POV-Ray  (www.povray.org). 

(shown  in  bold)  previously  reported  to  be  recognized  by  these 
domains  in  oriented  library  screens  (72).  Surprisingly,  no 
measurable  binding  was  detected  with  either  of  these  peptides 
at  pH  6.0  and  7.4  and  under  experimental  conditions  identical 
to  those  reported  (72).  Furthermore,  cocrystallization  of  these 
peptides  with  the  BARD1  BRCT  domains  yielded  crystals 
of  unbound  BARD1  protein  (data  not  shown).  These  results 
contradict  the  previously  reported  BARD1  BRCT  selectivity 
(72)  and  could  not  be  easily  attributed  to  different  experi¬ 
mental  conditions.  In  this  context,  it  is  important  to  note  that 
the  observed  BARD1  BRCT  specificities  for  the  ligand 
residues  at  selectivity  positions  0  to  +3  were  relatively  low 
(72),  suggesting  that  the  binding  is  very  weak  and  perhaps 
transient.  It  is  therefore  possible  that  additional  interactions 
between  residues  surrounding  the  ligand  consensus  motif  and 
amino  acids  outside  the  Pi  and  P2  pockets  are  required  for 
high-affinity  binding.  Alternatively,  the  BARD1  BRCT 
repeats  may  recognize  macromolecular  ligands  with  entirely 
different  sequence  motifs  and  structural  characteristics.  As 
part  of  our  attempts  to  identify  ligands  for  BARD1,  we  also 
tested  the  BACH1  peptide  ISRSTpSPTFNKQ,  which  is  an 
avid  ligand  for  the  BRCA1  BRCTs,  and  the  doubly  phos- 
phorylated  p53  peptide  EPPLpSQEpTFS  for  binding  to  the 
BARD1  BRCT  domains  in  similar  isothermal  titration 
calorimetry  experiments.  However,  as  expected  from  the 
nature  of  the  BARD1  P2  pocket,  which  is  missing  critical 
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hydrophobic  residues  that  make  contacts  with  the  phenyl  ring 
of  Phe  +3  in  the  BRCA1— ligand  structures  (13—16),  no 
binding  was  observed  with  these  peptides  (data  not  shown). 
Therefore,  the  analysis  of  the  structural  determinants  under¬ 
lying  ligand  selection  by  the  BARD1  BRCTs  will  await  the 
identification  of  bona  fide  ligands  for  these  domains. 

Mapping  of  Cancer-Associated  Missense  Mutations.  To 
obtain  insights  into  the  mechanisms  underlying  the  disruption 
of  BARD1  function  by  the  C645R,  V695L,  and  S761N 
mutations,  we  mapped  the  substituted  residues  on  the 
BARD1  BRCT  structure  (Figure  4A).  Cys645  is  positioned 
in  the  loop  between  the  a3  and  3ioa  helices.  Substitution  of 
this  residue  with  a  bulkier  and  charged  arginine  could  disrupt 
the  hydrophobic  interface  between  fil  and  al  formed  by 
Leu570,  Leu572,  Leu592,  and  Ala594  and/or  expose  this 
interface  by  breaking  hydrogen  bonds  between  Glu648  O62 
and  His606  Ne2  on  /33  and  between  Trp635  Nel  and  the 
carbonyl  oxygen  of  Gln646,  thereby  destabilizing  the  BRCT1 
fold.  Val695  is  part  of  the  al'  helix  facing  the  /3-sheet  of 
BRCT2,  and  replacement  of  its  isopropyl  group  by  the 
bulkier  isobutyl  side  chain  of  leucine  could  have  a  destabiliz¬ 
ing  effect  on  the  BRCT2  fold.  By  contrast,  Ser761  is  surface- 
exposed  and  lies  at  the  N-terminal  end  of  a3'  in  the  vicinity 
of  the  BARD1  P2  pocket.  This  residue  corresponds  to 
Glul836  in  the  BRCA1— CtIP  structure,  which  makes  water- 
mediated  hydrogen  bonds  with  the  carbonyl  oxygen  of  Gly 
+4  of  the  peptide  and  the  Argl699  Nv2  of  BRCA1  (16). 
Thus,  it  is  possible  that,  in  the  S761N  mutant,  a  hydrogen 
bond  is  formed  between  the  Asn761  O'51  and  His686  N62, 
effectively  locking  the  His686  imidazole  ring  in  an  unfavor¬ 
able  conformation  to  interact  with  the  side  chain  of  a  residue 
at  position  +3  (Figure  4B),  thereby  reducing  the  BARD1 
affinity  for  its  binding  partner. 

Perspective.  The  crystal  structure  of  the  BARD1  BRCT 
domains  reveals  the  atomic  features  of  the  pockets  Pi  and 
P2  and  provides  an  essential  basis  for  the  elucidation  of  the 
mechanisms  underlying  ligand  recognition  by  these  modules 
during  cell  cycle  control,  DNA  repair,  and  apoptosis.  In  light 
of  our  binding  results,  the  previously  reported  selectivity  of 
the  BARD1  BRCTs  for  peptides  having  a  glutamate  at 
position  +3  (72)  needs  to  be  independently  verified  and/or 
reevaluated  by  future  studies  aimed  at  dissecting  the  ligand 
specificity  of  these  modules.  When  the  physiologic  binding 
partners  of  the  BARD1  BRCTs  are  identified,  it  will  be 
possible  to  study  the  role  of  His685  and  His686  in  ligand 
recognition  and  analyze  the  structural  consequences  of  the 
cancer-associated  missense  mutations  on  the  binding  proper¬ 
ties  of  these  domains.  Perhaps  most  importantly,  the  three- 
dimensional  ligand-binding  interface  of  the  BARD1  BRCT 
repeats  provides  a  structural  framework  for  the  design  of 
small  molecules  that  could  specifically  regulate  the  BARD1 
function  by  modulating  the  BRCT  affinities  for  their  targets, 
with  potential  clinical  applications. 
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The  neuronal  protein  FE65  functions  in  brain  development  and  amyloid 
precursor  protein  (APP)  signaling  through  its  interaction  with  the 
mammalian  enabled  (Mena)  protein  and  APP,  respectively.  The  recognition 
of  short  polyproline  sequences  in  Mena  by  the  FE65  WW  domain  has  a 
central  role  in  axon  guidance  and  neuronal  positioning  in  the  developing 
brain.  We  have  determined  the  crystal  structures  of  the  human  FE65  WW 
domain  (residues  253-289)  in  the  apo  form  and  bound  to  the  peptides 
PPPPPPLPP  and  PPPPPPPPPL,  which  correspond  to  human  Mena  residues 
313-321  and  347-356,  respectively.  The  FE65  WW  domain  contains  two 
parallel  ligand-binding  grooves,  XP  (formed  by  residues  Y269  and  W280) 
and  XP2  (formed  by  Y269  and  W271).  Both  Mena  peptides  adopt  a 
polyproline  helical  II  conformation  and  bind  to  the  WW  domain  in  a 
forward  (N-C)  orientation  through  selection  of  the  PPPPP  motif  by  the  XP 
and  XP2  grooves.  This  mode  of  ligand  recognition  is  strikingly  similar  to 
polyproline  interaction  with  SH3  domains.  Importantly,  comparison  of  the 
FE65  WW  structures  in  the  apo  and  liganded  forms  shows  that  the  XP2 
groove  is  formed  by  an  induced-fit  mechanism  that  involves  movements  of 
the  W271  and  Y269  side-chains  upon  ligand  binding.  These  structures 
elucidate  the  molecular  determinants  underlying  polyproline  ligand 
selection  by  the  FE65  WW  domain  and  provide  a  framework  for  the  design 
of  small  molecules  that  would  interfere  with  FE65  WW-ligand  interaction 
and  modulate  neuronal  development  and  APP  signaling. 

©  2007  Elsevier  Ltd.  All  rights  reserved. 

Keywords:  FE65;  WW  domain;  Mena;  polyproline  helix  II;  crystal  structure 


Introduction 

FE65  is  expressed  predominantly  in  the  brain, 
where  it  plays  critical  roles  in  cortical  development 
and  APP  signaling.1  FE65  possesses  a  WW  domain 
(FE65  WW)  and  two  phosphotyrosine-binding 
(PTB)  domains  with  distinct  binding  specificities. 
The  FE65  WW  recognizes  polyproline  sequences 
present  in  several  proteins,  including  the  c-Abl 
tyrosine  kinase,2  the  mammalian  homolog  of  the 
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Drosophila  protein  enabled  (Mena),3  and  the  neuro¬ 
nal  P2X2  receptor.4  The  FE65  PTB1  binds  to  the 
histone  acetyltransferase  TIP60,5  the  transcription 
factor  CP2,6  and  the  cytoplasmic  domains  of  the 
low-density  lipoprotein  receptor-related  protein7 
and  ApoER2  receptor,8  whereas  PTB2  interacts 
with  the  APP  intracellular  domain.9  The  multi¬ 
protein  complexes  resulting  from  FE65-mediated 
interactions  underlie  the  function  of  this  protein  in 
regulation  of  neuronal  growth  cone  motility,10'11 
ATP-mediated  synaptic  transmission,4  learning  and 
memory,12  APP  processing,13-15  and  transcriptional 
regulation  of  gene  expression.5'16-22  FE65  WW  is 
essential  for  many  of  these  functions.  In  particular, 
the  FE65  WW-mediated  interaction  with  Mena  plays 
a  central  role  in  axonal  growth  cone  dynamics,  axon 
guidance,  and  neuronal  positioning  in  the  develop¬ 
ing  brain.23-27 

WW  domains,  named  for  two  conserved  trypto¬ 
phan  residues  (Figure  1(a)),  are  highly  compact 
protein-protein  interaction  modules  that  fold  into 
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(a) 


Group  II/III  253  259  269  271  280  289 

FE65  SDLPAGjjMRVQD  -  TSG  -  THYWH I  PTGTTQgEPPGRASPS 

fbpi i  pkqksmJtehks - pdgrtByyntetkqstJJekpddlktp 

FBP28  ATAVSEjTEYKT-ADGKTBYYNNRTLESTSEKPQELKEK 

prp40  msiBkeakd-asgribyyntltkkstBekpkelisq 

Group  I 

Dystrophin  TSVQGPnERAIS-PNKVPHYINHETQTTCflDHPKMTELY 

yap65  vplpagHemakt-ssgqrBflnhidqtttBqdprkamls 

Nedd4  pplppgSeekvd-nlgrtbyvnhnnrstqShrpslmdvs 

Group  IV 

Pinl  eklppgEekrmsrssgrvByfnhitnasqHerpsgnsss 


Figure  1.  (a)  Sequence  comparison  of  representative  WW  domains.  The  sequences  of  the  single  WW  of  FE65, 
dystrophin,  and  Pinl,  the  first  WW  of  FBP11,  PRP40,  YAP65,  and  Nedd4,  and  the  second  WW  of  FBP28,  were  aligned 
using  the  program  CLUSTAL  W.56  Hyphens  represent  gaps  inserted  for  optimum  alignment.  The  secondary  structure 
elements  of  FE65  WW  are  indicated  at  the  top.  Residues  of  human  FE65  are  numbered.  Two  conserved  tryptophan 
residues  (after  which  the  domain  is  named)  are  shown  in  white  on  a  blue  background.  Invariant  tyrosine  residues  (shown 
in  white  on  a  red  background)  and  bulky  hydrophobic  residues  (highlighted  in  yellow)  form  the  XP2  groove  in  group 
II/III  domains.  Conserved  threonine /serine  residues  that  hydrogen  bond  with  the  ligand  are  highlighted  in  green,  (b) 
Ribbon  diagram  of  the  FE65  WW  domain.  Residues  that  form  the  XP  and  XP2  groves  are  shown  as  yellow  stick  models. 
The  side-chains  of  residues  forming  a  hydrophobic  core  that  stabilizes  the  fold  are  shown  in  pink,  (c)  A  stereo  view  of 
eight  FE65  WW  molecules  present  in  the  asymmetric  unit.  Superposition  of  these  independent  structures  shows  that  they 
fall  into  two  distinct  conformations  highlighted  in  shades  of  green  (apo  form)  and  pink  (bound  to  a  PEG400  molecule). 
Note  the  conformational  changes  of  the  W271  and  Y269  side-chains  in  the  bound  form,  (d)  In  the  apo  form,  the  XP2  groove 
is  shallow,  (e)  Binding  of  a  PEG400  molecule  induces  the  formation  of  a  deep  XP2  groove,  primarily  through 
conformational  changes  of  the  indole  side-chain  of  W271  and,  to  a  lesser  degree,  of  the  phenyl  ring  of  Y269.  The  Figure 
was  made  using  PyMol  [www.pymol.org]. 


stable,  slightly  curved,  three-stranded  antiparallel 
p-sheet  structures.28-31  These  domains  interact  with 
short  proline-rich  sequences  and  are  found  in  pro¬ 


teins  of  diverse  functions,  including  regulation  of 
transcription,  RNA  processing,  ubiquitin  ligation, 
protein  trafficking,  receptor  signaling,  and  control  of 
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the  cytoskeleton. 28,32  WW  domains  were  initially 
categorized  into  four  groups  on  the  basis  of  their 
ligand-binding  specificity.  Group  I  binds  polypep¬ 
tides  with  the  core  consensus  PPXY  (PY  motif), 
where  X  can  be  any  residue;  group  II  recognizes  the 
sequence  PPLP  (PL  motif);  group  III  selects  for 
proline-rich  sequences  with  arginine  residues  (PR 
motif);  and  group  IV  interacts  with  phospho(serine/ 
threonine)-proline  containing  peptides  (pS/pTP 
motif).28  The  concave  surface  of  the  WW  domain 
contains  a  series  of  exposed,  nearly  parallel  aromatic 
residues,  forming  a  hydrophobic  cavity  that  receives 
the  polyproline  peptide.  Group  I  and  IV  domains 
have  a  single  groove  formed  by  two  highly 
conserved  aromatic  residues  (typically  tyrosine  in 
the  |3>2  strand  and  tryptophan  in  the  |2>3  strand), 
which  receives  the  XP  residues  of  the  peptide  (XP 
groove).28,29  By  contrast,  group  II  and  III  WW 
domains  have  an  additional  groove,  XP2,  which  is 
formed  by  the  conserved  tyrosine  and  an  aromatic 
residue  in  the  |2>2  strand.33'34  In  this  respect,  groups 
II  and  III  are  more  similar  to  SH3  domains,  which 
have  two  successive  XP  grooves,  suggesting  that 
these  structurally  and  evolutionarily  unrelated 
protein  modules  have  converged  on  a  similar 
mechanism  for  polyproline  recognition.  On  the 
basis  of  the  distinct  structural  features  and  similar 
binding  specificities  of  group  II  and  III  WW 
domains,  including  recognition  of  uninterrupted 
polyproline  sequences  (PPPPP)  in  addition  to  PL 
and  PR  motifs,  it  was  proposed  recently  that  these 
groups  be  merged  into  a  larger  class  II /III. 35 

Structural  studies  of  WW  domains  bound  to  their 
cognate  ligands,  including  the  crystal  structures  of 
dystrophin36  (group  I)  and  Pinl37  (group  IV),  and 
the  solution  structures  of  YAP65,38  Nedd4,39,40  and 
Smurf241  (group  I),  and  FBP1133,34  (group  II/III) 
WW-peptide  complexes,  provided  insights  into  the 
molecular  mechanisms  underlying  ligand  selectivity 
by  these  modules.  The  peptides  adopt  a  polyproline 
II  (PPII)  helical  conformation  with  core  proline 
residues  packing  against  the  XP  and  XP2  grooves, 
and  carbonyl  oxygen  atoms  hydrogen  bonding  with 
WW  residues.  Interactions  between  residues  outside 
the  ligand  core  motif  and  variable  loops  of  the  WW 
domain,  as  well  as  adjacent  domains,  contribute  to 
the  specificity  and  affinity.29,30,41  Importantly,  WW 
domains  can  bind  to  ligands  in  opposite  orientations, 
as  was  first  observed  with  SH3  domains.42,43  For 
example,  the  Pinl  WW  binds  to  the  ligand  in  an  N-C 
or  +  orientation,37  whereas  the  dystrophin  WW 
interacts  with  the  peptide  in  a  C-N  or  -  orientation,36 
because  of  a  twofold  rotational  pseudosymmetry  in 
the  shapes  and  hydrogen  bonding  networks  of  the 
WW  grooves  and  PPII  helices.29 

Insights  into  the  architecture  of  the  XP  and  XP2 
grooves  of  group  II/III  domains  were  obtained  from 
the  solution  structure  of  FBP11  WW1  bound  to  a 
peptide  having  the  core  sequence  PPLP.33,34  A 
prominent  feature  of  this  structure  is  the  non-parallel 
arrangement  of  the  XP2  and  XP  grooves,  with  the 
former  being  occupied  by  the  pair  PP  and  the  latter 
receiving  the  residues  LP.  However,  because  this  is 


currently  the  only  structure  of  a  group  II/III  WW 
domain,  it  is  not  clear  whether  the  observed 
orientation  of  the  XP  and  XP2  grooves  is  a  common 
feature  of  all  members  of  this  group.  The  FE65  WW 
in  particular,  is  distinct  from  other  group  II/III 
domains  because  it  has  a  tryptophan  (W271)  instead 
of  tyrosine  in  the  (3»2  strand  that  forms  the  XP2  groove 
(Figure  1(a)),  suggesting  that  the  size  and  orientation 
of  this  groove  might  be  different.  Therefore,  struc¬ 
tures  of  the  FE65  WW  and  other  group  II/III  WW 
domains  bound  to  their  ligands  will  advance  our 
understanding  of  the  molecular  basis  underlying 
ligand  recognition  by  these  protein  modules. 

Here,  we  report  the  atomic  structures  of  FE65  WW 
in  the  apo  form  and  bound  to  two  polyproline 
peptides  from  human  Mena,  which  represent  the 
first  high-resolution  crystallographic  analysis  of  a 
group  II/III  WW  domain.  The  FE65  WW  contains 
two  parallel  ligand-binding  grooves,  XP  and  XP2, 
the  latter  being  formed  by  an  induced-fit  mechan¬ 
ism.  These  grooves  select  for  the  PPPPP  motif  of  the 
Mena  peptides  using  a  mechanism  strikingly  similar 
to  that  of  polyproline  recognition  by  SH3  domains. 
These  structures  will  facilitate  the  design  of  small 
molecules  that  could  modulate  the  function  of  FE65, 
with  possible  clinical  applications. 

Results  and  Discussion 

Structure  determination 

We  expressed  a  protein  fragment  of  human  FE65 
(residues  253-289)  that  encompasses  the  WW 
domain,  as  a  glutathione-S-transferase  (GST)  fusion 
in  Escherichia  coli  cells,  purified  it  by  affinity 
chromatography,  and  released  the  FE65  WW  from 
the  GST  moiety  by  digestion  with  the  tobacco  etch 
virus  (TEV)  protease.  The  FE65  WW  protein  was 
further  purified  by  size-exclusion  chromatography. 
The  purified  protein  formed  multimeric  complexes 
in  solution,  as  judged  by  size-exclusion  chromato¬ 
graphy  and  dynamic  light-scattering  experiments 
(data  not  shown),  and  it  was  crystallized  by  the 
sitting-drop,  vapor-diffusion  method.  Initial  at¬ 
tempts  to  determine  the  crystal  structure  of  the 
apo  FE65  WW  by  molecular  replacement  using 
other  WW  structures  as  search  models  failed  (data 
not  shown).  The  structure  was  determined  by 
single-wavelength  anomalous  dispersion  (SAD) 
methods  of  selenomethionine  (SeMet)-substituted 
crystals.  The  structure  was  refined  to  2.28  A  resolu¬ 
tion  with  a  crystallographic  factor  Rc ryst  of  21.6% 
and  an  Rfree  of  28.2%  (Table  1).  Interestingly,  there 
are  eight  molecules  (A-H)  in  the  asymmetric  unit 
that  adopt  two  distinct  conformations  (see  below). 
We  also  crystallized  FE65  WW  bound  to  the  peptide 
MnlO  (PPPPPPPPPL),  corresponding  to  residues 
347-356  of  human  Mena,  and  determined  the 
structure  by  multiwavelength  anomalous  dispersion 
(MAD)  methods,  using  crystals  of  MnlO  bound  to 
SeMet-substituted  FE65  WW.  The  crystal  structure 
was  refined  to  1.33  A  resolution  with  an  Rcryst  of 
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Table  1.  Statistics  of  structure  determination  and  refinement 


WW  Apoa 

WW-Mnl0b 

WW-Mn9b 

A.  Data  collection  and  phasing 

Data  set 

Native 

SAD  XI 

Native 

MAD  XI 

MAD  X2 

MAD  X3 

Native 

Wavelength  (A) 

0.975 

0.9789 

0.985 

0.9792 

0.9793 

0.960 

1.000 

Resolution  (A) 

50-2.19 

50-2.53 

50-1.33 

50-1.58 

50-1.58 

50-1.50 

50-1.27 

Unique  reflections 

20584 

13311 

9129 

5591 

5595 

6494 

14379 

Completeness  (%)c 

99.4(98.6) 

98.9(89.2) 

98.6(90.3) 

99.9(100) 

99.9(100) 

99.3(93.2) 

97.6(93.5) 

Redundancy  (%) 

9.2(8.2) 

36.0(27.1) 

6.5(5.4) 

20.5(19.6) 

19.9(19.1) 

12.8(9.6) 

7.5(5.4) 

RsyJ  (%) 

4.5(32.3) 

8.5(42.7) 

5.7(44.0) 

7.1(31.3) 

6.3(32.8) 

6.5(43.8) 

4.4(38.6) 

<I>  /  <a(f)> 

43.3(5.3) 

60.7(7.8) 

30.1(3.8) 

53.3(12.9) 

52.6(12.1) 

41.3(5.3) 

27.5(3.7) 

B.  Refinement 

Resolution  range  (A) 

37.3-2.28 

17.03-1.33 

25.8-1.35 

Reflections  in  working  set 

17,415 

8654 

11,423 

Reflections  in  test  set 

924 

431 

586 

Fcryst6  (%) 

Rfree  (%) 

21.6 

17.2 

19.2 

28.2 

21.4 

22.4 

Average  isotropic  B  factors 

Protein  (A2) 

Peptide  (A2) 

47.2 

18.9 

23.6 

20.3 

29.6 

Ramachandran  plot 

Allowed  regions  (%) 

87.8 

91.3 

92.9 

Additionally  allowed  regions  (%) 

12.2 

8.7 

3.6 

Generously  allowed  regions  (%) 

0.0 

0.0 

3.6 

Disallowed  regions  (%) 

0.0 

0.0 

0.0 

Number  of  water  molecules 

119 

34 

36 

a  Crystals  of  Fe65  WW  in  the  apo  form  were  analyzed  at  NSLS  beamline  X12C. 

b  Crystals  of  the  Fe65  WW-MnlO  and  Fe65  WW-Mn9  complexes  were  analyzed  at  NSLS  beamline  X12B. 

c  The  numbers  in  parentheses  correspond  to  the  highest  resolution  shell  (2.27-2.19  A  for  WW  Apo  Native,  2.62-2.53  A  for  SAD, 
1.38-1.33  A  for  WW-MnlO  Native,  1.64-1.58  A  for  MAD  XI  and  \2,  1.55-1.50  A  for  MAD  \3,  and  1.32-1.27  A  for  WW-Mn9  Native). 

d  RSym  =  Z  I  (I-  </>)  I  /ZCD/  where  I  is  the  observed  integrated  intensity,  <I>  is  the  average  integrated  intensity  obtained  from  multiple 
measurements,  and  the  summation  is  over  all  observed  reflections. 
e  l^cryst  =  Z|  l^obs  |  -  k  |  FCaicll/Z  I  Fobs  I  /  where  Fobs  and  Fcaic  are  the  observed  and  calculated  structure  factors,  respectively. 
f  Rfree  is  calculated  as  Rcryst  using  5%  of  the  reflection  data  chosen  randomly  and  omitted  from  the  refinement  calculations. 


17.2%  and  an  Rfree  of  21.4%.  Using  this  structure  as  a 
search  model,  we  also  solved  by  molecular  replace¬ 
ment  the  crystal  structure  of  the  FE65  WW  bound  to 
the  peptide  Mn9  (PPPPPPLPP)  spanning  Mena 
residues  313-321,  and  refined  it  to  1.35  A  resolution 
with  an  Rcryst  of  19.2%  and  an  Pfree  of  22.4%.  Both 
FE65  WW-Mn9  and  FE65  WW-MnlO  crystals  have 
one  molecule  in  the  asymmetric  unit. 

Crystal  structure  of  FE65  WW  in  the  apo  form 

FE65  WW  adopts  the  typical  WW  fold,  comprising 
an  antiparallel  |2> -sheet  formed  by  three  (3 -strands 
((3l-|3>3)  and  two  short  loops  (LI  and  L2)  (Figure  1(a) 
and  (b)).  At  the  convex  surface  of  the  |3-sheet  the 
conserved  aromatic  residues  Y270  and  W259  stack 
against  P282  and  the  invariant  P283,  respectively, 
forming  a  hydrophobic  cluster  that  stabilizes  the 
fold  (Figure  1(b)).  This  hydrophobic  core  is  shielded 
from  the  solvent  by  L255  and  P256,  and  it  brings 
together  the  N  and  C  termini,  generating  a  domain 
that  can  be  transferred  as  a  functional  unit  between 
proteins.  The  concave  ligand-binding  surface  con¬ 
tains  two  distinct  grooves,  XP  and  XP2.  The 
canonical  XP  groove  is  formed  by  Y269  and  W280, 
and  its  size  is  determined  by  the  LI  loop,  which  is 
essentially  a  short  (3-turn,  and  T278  in  the  (33  strand. 
The  XP2  groove  is  formed  primarily  by  Y269,  W271, 
and  V262.  In  loop  L2,  H272  stacks  against  P274 
exposing  the  hydrophobic  side-chain  of  1273,  which 


in  turn  restricts  the  movement  of  the  W271  indole 
ring  and  indirectly  influences  the  XP2  groove. 
Because  FE65  WW  has  only  20  residues  between 
the  conserved  W259  and  W280  (Figure  1(a)),  its 
H272  has  shifted  towards  the  N  terminus  by  one 
residue  compared  to  other  WW  domains  and,  as  a 
result,  its  imidazole  ring  lies  below  the  (3 -sheet.  By 
contrast,  the  corresponding  histidine  in  group  I  WW 
domains  lies  above  the  (3-sheet  and  participates  in 
ligand  recognition  by  hydrogen  bonding  with  the 
tyrosine  of  the  PY  motif. 36/38-41 

Formation  of  the  FE65  WW  XP2  groove  by  an 
induced-fit  mechanism 

Superimposition  of  the  eight  molecules  present  in 
the  asymmetric  unit  shows  that  they  are  grouped 
into  two  distinct  conformations  (Figure  1(c)).  Mole¬ 
cules  B,  C,  E,  and  H  adopt  a  conformation  in  which 
the  XP2  groove  is  unoccupied  and  the  side-chains  of 
W271  and  Y269  are  approaching  each  other, 
effectively  closing  this  groove  (Figure  1(c)  and  (d)). 
By  contrast,  molecules  A,  D,  F  and  G  have  a  PEG400 
molecule  (from  the  crystallization  solution)  bound 
to  the  XP2  groove,  resulting  in  a  rotation  of  the 
W271  indole  ring  plane  by  an  average  of  60°  relative 
to  the  unbound  one  (Figure  1(c)  and  (e))o.  The  phenyl 
ring  of  Y269  is  also  displaced  by  ~1.5  A,  effectively 
increasing  the  width  of  the  XP2  groove  to  accom¬ 
modate  the  ligand.  Sequence  alignment  shows  that 
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W271  corresponds  to  smaller  aromatic  residues  (Tyr 
or  Phe)  in  other  group  II /III  and  IV  domains  and 
hydrophobic  residues  in  group  I  domains  (Figure 
1(a)).  Notably,  mutation  of  the  corresponding 
leucine  (L190)  to  tryptophan  in  the  YAP65  WW1 
domain  shifts  the  specificity  of  this  domain  from 
group  I  to  that  of  group  II.44  The  present  crystal 
structure  provides  an  explanation  for  this  switch 
and  elucidates  the  molecular  basis  for  the  important 
role  of  the  W271  indole  ring  in  the  formation  of  the 
XP2  groove  by  an  induced-fit  mechanism. 

Structural  determinants  of  polyproline 
recognition  by  FE65  WW 

It  was  shown  that  the  human  Mena  polyproline 
sequences  spanning  residues  313-321  and  347-356 
are  ligands  for  the  FE65  WW.3  To  evaluate  the 
contribution  of  these  motifs  to  the  Mena  interaction 
with  FE65  WW,  we  synthesized  the  peptides  Mn9 
and  MnlO  that  correspond  to  these  regions  and  de¬ 
termined  their  binding  affinities  for  FE65  WW  using 
isothermal  titration  calorimetry.  FE65  WW  binds  to 
Mn9  and  MnlO  peptides  with  dissociation  constants 
124(±9.5)  pM  and  116(±6.5)  jiM,  respectively  (Figure 
2).  The  micromolar  affinities  are  in  agreement  with 
previously  measured  affinities  for  other  WW-ligand 
complexes,45  and  are  thought  to  underlie  the  for¬ 
mation  of  transient  complexes  by  these  modules 
during  dynamic  cellular  processes. 

The  structural  basis  for  the  specificity  and  affinity 
of  the  FE65  WW-MnlO  interaction  was  elucidated 
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by  the  crystal  structure  of  this  complex  (Figure  3(a) 
and  (b)).  The  peptide  adopts  a  PPII  helix,  and  each  of 
the  XP  and  XP2  grooves  is  occupied  by  two  proline 
residues:  P5T6'  insert  into  XP2  and  P8T9'  occupy 
XP  (peptide  residues  denoted  with  primes).  The 
residues  Pl'-P4'  extend  beyond  the  WW  molecule 
(Figure  3(a)),  with  P1'P2'  contacting  loop  L2  of  an 
adjacent  crystallographic  molecule.  Notably,  the 
side-chain  of  LI  O'  faces  toward  the  solvent  and 
does  not  make  any  contact  with  the  protein  (Figure 
3(a)).  The  XP2  groove  is  in  the  open  conformation,  as 
observed  in  the  PEG400-bound  FE65  WW.  Indeed, 
the  W271  and  Y269  side-chains  of  FE65  WW  bound 
to  MnlO  superimpose  almost  exactly  with  the 
corresponding  side-chains  of  the  PEG400-bound 
FE65  WW,  indicating  that  an  induced-fit  mechanism 
underlies  polyproline  ligand  recognition  by  FE65 
WW.  Importantly,  the  separation  of  the  two  grooves 
by  Y269  and  T278  necessitates  the  presence  of  a 
spacer  residue  between  the  two  pairs  of  proline 
residues  that  occupy  the  grooves.  Although  in  the 
FE65  WW-MnlO  complex  the  spacer  is  proline  (P7r), 
this  position  could  be  occupied  by  any  amino  acid 
(X)  without  breaking  the  PPII  conformation,  indicat¬ 
ing  that  the  sequence  PPXPP  is  the  general  recogni¬ 
tion  motif  of  FE65  WW. 

Because  the  MnlO  peptide  does  not  contain  an 
intact  PPLP  motif,  reported  to  be  recognized  by 
FE65  WW,3  we  also  determined  the  crystal  structure 
of  FE65  WW  bound  to  the  peptide  Mn9  that  contains 
this  sequence.  Surprisingly,  the  XP  and  XP2  grooves 
of  FE65  WW  also  bind  to  the  PPPPP  and  not  to  the 
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Figure  2.  Isothermal  titration  calorimetry  results  obtained  for  the  FE65  WW  interaction  with  (a)  peptide  MnlO  and  (b) 
peptide  Mn9,  as  described  in  Experimental  Procedures. 
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Figure  3.  (a)  A  ribbon  diagram  of  the  FE65  WW  bound  to  peptide  MnlO,  shown  as  a  stick  model.  WW  residues  critical 
for  interaction  with  the  peptide  are  shown  as  yellow  stick  models.  N  and  O  atoms  are  colored  blue  and  red,  respectively. 
Hydrogen  bonds  are  denoted  by  broken  lines,  (b)  A  weighted  2Fobs-Fcalc  electron  density  map  of  the  FE65  WW-MnlO 
structure  calculated  at  1.33  A  and  contoured  at  2ct.  (c)  A  ribbon  diagram  of  the  FE65  WW  bound  to  peptide  Mn9.  Atoms 
are  colored  as  in  (a).  A  water  molecule  is  indicated  by  a  cyan  sphere,  (d)  Superposition  of  the  FE65  WW  (green)  bound  to 
MnlO  (yellow)  with  the  FE65  WW  (cyan)  bund  to  Mn9  (orange)  structures.  For  clarity,  only  the  parts  of  the  ligands  that 
interact  with  the  XP  and  XP2-forming  aromatic  residues  of  FE65  WW  are  shown.  The  Figure  was  made  using 
BOBSCRIPT57  and  POV-Ray  (www.povray.org).  (e)  and  (f)  Two-dimensional  representation  of  the  FE65  WW  interactions 
with  the  MnlO  and  Mn9  peptides,  respectively  (made  using  LIGPLOT58).  FE65  and  Mena  residues  are  colored  orange  and 
purple,  respectively.  Hydrogen  bonds  are  shown  as  broken  lines,  hydrophobic  interactions  as  arcs  with  radial  spokes,  and 
a  water  molecule  as  a  cyan  sphere. 
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PPLP  motif  (Figure  3(c)).  Like  MnlO,  Mn9  forms  a 
PPII  helix,  with  P1'P2'  residues  inserting  into  XP2 
and  P4'P5'  occupying  the  XP  groove,  further  cor¬ 
roborating  the  notion  that  the  FE65  grooves  have  a 
stronger  preference  for  proline  rather  than  leucine.  It 
appears  that  the  small  size  of  the  FE65  XP  groove, 
which  is  determined  by  the  short  loop  LI,  plays  an 
important  role  in  the  selection  of  the  pyrrolidine  ring 
of  proline  over  the  bulkier  isobutyl  group  of  a 
leucine  residue.  By  contrast,  the  longer  LI  loop  of 
FBP11  WW1  generates  a  larger  XP  groove  that  can 
accommodate  a  leucine  residue.  Nevertheless,  in  the 
FE65  WW-Mn9  structure  the  side-chain  of  L7' 
clamps  the  indole  group  of  W280  outside  the  XP 
groove,  providing  additional  stabilization  to  the 
complex  (Figure  3(c)). 

Superposition  of  the  FE65  WW-MnlO  and  FE65 
WW-Mn9  complexes  shows  that  the  overall  struc¬ 
tures  are  very  similar  (root-mean-square  deviation 
0.61  A  for  all  Ca  atoms  of  the  WW  region  spanning 
L255-P283  and  the  atoms  of  the  five  proline  residues 
that  occupy  the  two  grooves).  However,  it  is  noted 
that  the  MnlO  and  Mn9  proline  residues  occupying 
the  XP  groove  are  not  superimposed  as  well  as  those 
inserting  into  the  XP2  pocket  (Figure  3(d)),  likely 
reflecting  the  different  crystal  packing  arrangements 
of  the  two  complexes  due  to  differences  in  the 
crystallization  conditions  and/or  peptide  composi¬ 
tion.  Notably,  the  C-terminal  region  of  the  Mn9 
peptide  extends  beyond  the  WW  domain  and  the 
side-chain  of  L7'  makes  hydrophobic  contacts  with 
adjacent  crystallographic  molecules,  resulting  in  a 
more  tightly  coiled  PPII  helix.  Notwithstanding  the 
possible  crystal  lattice  effects  that  may  influence  the 
position  of  the  peptide  inside  the  XP  and  XP2 
grooves,  these  structures  demonstrate  the  flexibility 
of  the  FE65  WW  grooves  to  accommodate  slightly 
different  positions  of  the  ligand  (Figure  3(d)).  Both 
the  FE65  WW  complexes  with  the  MnlO  and  Mn9 
peptides  are  stabilized  by  similar  interactions, 
although  subtle  differences  are  observed  in  the 
hydrophobic  contacts  and  hydrogen  bonding  net¬ 
works  of  the  two  structures  (Figure  3(e)  and  (f)). 
Most  notably,  the  carbonyl  oxygen  atoms  of  P6'  and 
P9'  in  MnlO  (P2r  and  P5'  in  Mn9)  form  direct 
hydrogen  bonds  with  the  hydroxyl  group  of  Y269 
and  the  Nel  of  W280,  respectively.  By  contrast,  the 
O71  atom  of  T278  hydrogen  bonds  directly  with  the 
carbonyl  oxygen  atom  of  the  spacer  residue  P7'  in 
the  FE65  WW-MnlO  complex  but  indirectly  through 
a  water  molecule  with  the  carbonyl  oxygen  atom  of 
P3'  in  the  FE65  WW-Mn9  structure. 

Similarities  between  the  FE65  WW  and  SH3 
ligand-binding  surfaces 

A  prominent  feature  of  the  FE65  WW-MnlO  and 
FE65  WW-Mn9  structures  is  the  parallel  orientation 
of  the  XP  and  XP2  grooves,  which  is  due  in  large 
part  to  the  W271  indole  ring  that  forms  the  external 
wall  of  XP2  (Figure  4(a)  and  (b)).  By  comparison, 
two  solution  structures  of  the  FBP11  WW1  bound  to 
the  peptide  PPLP  revealed  that  the  XP  and  XP2 


grooves  are  not  parallel  (Figure  4(c)).33'34  The  FBP11 
WW1  XP2  groove  is  formed  by  two  tyrosine 
residues  (Y23  and  Y25)  instead  of  a  tyrosine  and  a 
tryptophan  in  FE65  (Y269  and  W271),  and  the  FBP11 
grooves  are  nearly  continuous  because  of  the 
smaller  side-chain  of  S32  compared  to  the  T278  in 
FE65  WW.  As  a  result,  the  FBP11  WW1  grooves  are 
occupied  by  successive  pairs  of  residues,  without  a 
spacer  amino  acid.  In  contrast,  the  parallel  and  non- 
continuous  nature  of  the  FE65  WW  grooves  under¬ 
lies  the  recognition  of  a  ligand  having  two  pairs  of 
proline  residues  separated  by  one  spacer  amino 
acid,  the  side-chain  of  which  makes  no  contributions 
to  the  interaction. 

The  parallel  orientation  of  the  FE65  WW  grooves 
is  a  novel  structural  feature  of  group  II /III  domains 
and  is  remarkably  similar  to  the  arrangement  of  the 
SH3  proline-binding  grooves  (Figure  4(a)  and  (d)), 
despite  the  different  folds  of  these  protein  modules. 
Indeed,  superposition  of  the  FE65  WW-MnlO  and 
the  Abl  SH3-3BP1  crystal  structures  shows  strik¬ 
ingly  similar  positions  of  the  three  aromatic 
residues,  which  emanate  from  different  secondary 
structure  elements  in  the  WW  and  SH3  domains  to 
form  the  two  polyproline-binding  grooves  (Figure 
4(e)).  The  almost  identical  spatial  position  of  these 
critical  aromatic  triads  provides  strong  evidence 
that  these  evolutionarily  unrelated  domains  have 
converged  upon  a  similar  structural  mechanism  for 
PPII  recognition.29'46 

Perspective 

This  work  presents  the  first  crystal  structure  of  a 
group  II /III  WW  domain  and  provides  mechanistic 
insights  into  uninterrupted  polyproline  ligand 
recognition  by  FE65  WW  at  the  atomic  level.  A 
prominent  feature  of  this  domain  is  the  presence  of  a 
second  groove,  XP2,  which  is  parallel  with  the 
conserved  groove  XP  and  is  formed  by  an  induced- 
fit  mechanism.  The  parallel  and  non-continuous 
arrangement  of  the  XP  and  XP2  grooves  necessitates 
a  spacer  residue  between  the  two  pairs  of  proline 
residues,  leading  to  the  recognition  of  the  PPXPP 
sequence,  where  X  is  any  residue.  Remarkably,  the 
architecture  of  these  grooves  is  more  similar  to  that 
of  SH3  domains  than  FBP11  WW1,  the  only  other 
group  II /III  WW  domain  with  known  structure, 
providing  evidence  for  evolutionary  convergence. 
Thus,  the  high-resolution  crystal  structures  of  FE65 
WW  bound  to  polyproline  ligands  advances  our 
understanding  of  the  principles  underlying  ligand 
selection  by  this  versatile  domain.  This  informa¬ 
tion  will  facilitate  the  design  of  novel  WW  domains 
with  desired  ligand  specificities  and  affinities,  as 
described.47,48  Importantly,  the  specific  and  low- 
affinity  interactions  of  WW  domains  with  their 
polyproline  ligands  make  them  excellent  targets  for 
drug  development.45  Given  the  significance  of  the 
FE65  WW-mediated  protein  interactions  in  brain 
development  and  pathogenesis  of  human  diseases, 
including  Alzheimer's  disease,  the  present  work  will 
provide  a  framework  for  the  development  of  small 
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Figure  4.  (a)  Surface  representation  of  the  FE65  WW  bound  to  the  MnlO  peptide  (shown  as  a  stick  model). 
Semitransparent  rectangles  denote  the  parallel  orientation  of  the  XP  and  XP2  grooves,  (b)  A  diagram  of  the  FE65  WW 
bound  to  MnlO  PPII  helix  (shown  as  a  triangular  prism),  illustrating  the  parallel  arrangement  of  the  XP  and  XP2  grooves, 
the  three  aromatic  residues  that  form  them,  and  the  proline  residues  of  the  ligand  that  occupy  them  (orange  spheres). 
Unbound  proline  and  leucine  residues  are  shown  as  gray  and  cyan  spheres,  respectively,  (c)  Surface  representation  of  the 
FBP11  WW1  bound  to  PPLP  sequence  (PDB  code  2DYF),  showing  the  non-parallel  arrangement  of  the  XP  and  XP2 
grooves  (rectangles),  (d)  Surface  representation  of  the  Abl  tyrosine  kinase  SH3  domain  bound  to  the  peptide  3BP1 
containing  the  sequence  PPLPP  (PDB  code  1ABO),  showing  the  parallel  XP  and  XP2  grooves  (rectangles),  (e)  A  stereo 
view  of  the  FE65  WW-MnlO  structure  superimposed  on  the  Abl  SH3-3BP1  complex,  showing  the  similar  mode  of  PPII 
recognition  by  the  aromatic  triad  of  WW  (yellow)  and  SH3  (light  green)  domains.  The  MnlO  and  3BP1  peptides  are  shown 
in  cyan  and  pink,  respectively. 
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molecules  that  could  modulate  these  interactions 
and  regulate  the  function  of  FE65,  with  possible 
clinical  applications. 

Experimental  Procedures 

Protein  expression  and  crystallization 

A  DNA  fragment  encoding  the  human  FE65  WW 
(residues  253-289)  with  a  recognition  site  for  the  TEV 
protease  (ENLYFQjG)  at  its  N  terminus  was  amplified  by 
the  polymerase  chain  reaction  method  and  was  cloned 
into  a  modified  pGEX-2T  vector  (GE  Healthcare).  The 
FE65  WW  protein  was  expressed  in  E.  coli  BL21(DE3)  cells 
as  a  GST  fusion,  purified  on  glutathione  Sepharose, 
released  from  GST  by  digestion  with  TEV,  and  was  further 
purified  by  size-exclusion  chromatography  on  a  Superdex 
30  column  (GE  Healthcare).  SeMet-substituted  FE65  WW 
protein  was  produced  in  E.  coli  B834  cells  (Novagen) 
grown  in  M9  medium  supplemented  with  40  mg/1  of 
SeMet  (Sigma),  and  the  protein  was  purified  in  the  same 
manner  as  the  unlabelled  protein  in  the  presence  of  5  mM 
DTT  throughout  the  entire  purification  process. 

The  FE65  WW  protein  was  concentrated  by  ultracen¬ 
trifugation  to  40  mg/ml  in  20  mM  Tris-HCl  (pH  7.6), 
150  mM  NaCl,  and  was  used  for  crystallization  experi¬ 
ments  by  the  sitting-drop,  vapor-diffusion  method.  Crys¬ 
tals  of  FE65  WW  in  the  apo  form  were  obtained  in  0.1  M 
Hepes  (pH  7.5),  2.2  M  ammonium  sulfate,  2%  (w/v) 
PEG400  at  20  °C.  Crystals  were  cryoprotected  by  stepwise 
transfer  in  0.1  M  Hepes  (pH  7.5),  2.3  M  ammonium  sulfate, 
10%  (v/v)glycerol,  10%  (w/v)  xylitol.  The  crystals  belong 
to  space  group  P6322  with  unit  cell  dimensions 
a  =  b  =  75. 61  A,  c-  226.49  A,  a  =  b  =  90°,  y  =  120°.  There  are 
eight  molecules  in  the  asymmetric  unit.  The  crystallized 
FE65  WW  protein  includes  the  vector-derived  residue  G  at 
its  N  terminus. 

For  crystallization  of  FE65  WW-Mena  complexes,  the 
protein  concentration  was  adjusted  to  18  mg/ ml  in  20  mM 
Tris-HCl  (pH  7.6),  150  mM  NaCl,  mixed  with  the  peptides 
MnlO  or  Mn9  at  — 1:3  (protein/peptide)  molar  ratios, 
incubated  at  4  °C  for  1-2  h,  and  used  for  crystallization  by 
the  sitting-drop,  vapor-diffusion  method.  Crystals  of  the 
FE65  WW-MnlO  complex  grew  in  0.1  M  Hepes  (pH  7.4), 
1.7-1 .9  M  ammonium  sulfate,  at  20  °C,  and  they  were 
cryoprotected  in  0.1  M  Hepes  (pH  7.4),  1.8  M  ammonium 
sulfate,  20%  glycerol.  They  belong  to  space  group  P3i21 
with  unit  cell  dimensions  a  =  b  =  41.66  A,  c-  38.63  A, 
a  =  |3=90o,  y  =  120°,  and  have  one  molecule  in  the  asym¬ 
metric  unit.  Crystals  of  the  FE65  WW-Mn9  complex  grew 
in  0.1  M  Hepes  (pH  7.6),  1.3  M  sodium  citrate,  and  were 
cryoprotected  in  0.05  M  Hepes  (pH  7.6),  1.52  M  sodium 
citrate.  They  belong  to  space  group  P4x2!2  with  unit  cell 
dimensions  a  =  b  =  37.5Q  A,  c- 78.46  A,  a  =  b  =  7  =  90°,  and 
there  is  one  molecule  in  the  asymmetric  unit.  The  data  were 
processed  using  the  HKL2000  package.49  Data  collection 
and  processing  statistics  are  given  in  Table  1. 

Structure  determination  and  refinement 

The  crystal  structure  of  FE65  WW  in  the  apo  form  was 
determined  by  SAD  methods,  using  SeMet-substituted 


protein  and  data  collected  on  beamline  XI 2C  at  the 
National  Synchrotron  Light  Source  (NSLS),  Brookhaven 
National  Laboratory,  Long  Island,  NY.  The  structure  of  the 
FE65  WW-MnlO  complex  was  solved  by  MAD  methods, 
using  SeMet-substituted  FE65  WW  protein  and  data 
collected  at  NSLS,  beamline  X12B.  Initial  SAD  and  MAD 
phases  were  calculated  using  SHELX,50  followed  by 
several  cycles  of  model  building  and  refinement  using 
COOT,51  REFMAC5,52  and  ARP/wARP.53  The  FE65  WW- 
Mn9  structure  was  determined  by  molecular  replacement 
using  MOLREP,54  and  the  FE65  WW-MnlO  complex  as  a 
search  model.  PROCHECK55  was  used  for  analysis  and 
validation  of  the  refined  structures  (Table  1). 

Isothermal  titration  calorimetry 

Binding  constants  of  FE65  WW  for  the  synthetic 
peptides  Mn9  and  MnlO  were  measured  using  a  VP-ITC 
microcalorimeter  (MicroCal).  Briefly,  0.71  mM  Mn9  and 
0.74  mM  MnlO  were  titrated  against  63.9  pM  FE65  WW 
protein  in  20  mM  Tris-HCl  (pH  7.6),  150  mM  NaCl  at 
23  °C.  Titration  curves  were  analyzed  using  the  program 
ORIGIN  5.0  (OriginLab).  Protein  and  peptide  concentra¬ 
tions  were  determined  by  quantitative  amino  acid  analysis 
on  an  ABI  420 A  derivatizer/ analyzer  and  an  ABI  103 A 
separation  system  (Applied  Biosystems). 

Protein  Data  Bank  accession  codes 

The  atomic  coordinates  and  structure  factors  of  FE65 
WW  in  the  apo  form  and  bound  to  Mena  peptides  Mn9 
and  MnlO  have  been  deposited  in  the  Protein  Data  Bank 
with  accession  codes  2IDH,  20EI,  and  2H02,  respectively. 
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